Multi-Stage Complex Contagions 
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The spread of ideas across a social network can be studied using complex contagion models, 
in which agents are activated by contact with multiple activated neighbors. The investigation of 
complex contagions can provide crucial insights into social influence and behavior-adoption cascades 
on networks. In this paper, we introduce a model of a multi-stage complex contagion on networks. 
Agents at different stages — which could, for example, represent differing levels of support for a social 
movement or differing levels of commitment to a certain product or idea — exert different amounts 
of influence on their neighbors. We demonstrate that the presence of even one additional stage 
introduces novel dynamical behavior, including interplay between multiple cascades, that cannot 
occur in single-stage contagion models. We find, for example, that cascades — and hence collective 
action — can be driven not only by higher-stage influencers but also by lower-stage influencers. 



Social movements and other forms of collective action, 
which often arise spontaneously, require an ensemble of 
supporters with different levels of commitment. Social 
influence and its potential to yield a critical mass of sup- 
porters can make a crucial difference as to whether or 
not movements succeed More generally, the ef- 

fect that other people's opinions and actions have on the 
decisions that we make is a crucial sociological consider- 
ation [U, @] , and the impact that individual influence 
can have on the large-scale spread of rumors, fads, be- 
liefs, and norms via social networks is of particular inter- 
est [7h3(]||. a closely related societal concern is that the 
mechanisms rooted in social interaction can give rise to 
financial crashes [3l| , political revolutions [Hj , successful 
technologies [U 
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and cultural market sensations 
The sudden changes in state exhibited in these ex- 
amples are known as cascades: Initially local behav- 
ior becomes widespread through collective action. The 
perceived similarity between social and biological epi- 
demics [1,[3^ has led to the use of the term contagion for 
the spread of social influence [s^ . Specifically, contagion 
refers to cases in which — much like with a virus or a 
disease — exposure to some source is enough to initiate 
propagation. Importantly, sociological contagions need 
not just spread from one specific source to another. In 
many situations, the chance of a node becoming active 
(e.g., adopting a new technology or joining a political rev- 
olution) depends on several other people who are active 
— and this is particularly true of people who are "close" 
or perceived as close in a social network. These types of 
contagions have been called complex contagions [10l4l2l |. 
Key investigations of complex contagions have examined 
the diffusion of applications on the social networking site 
Faceboo k 1371 . memes (short textual phrases) on news 
websites [38[ and other social media [39[ , information on 
blogs [lO] and on the micro-blogging service Twitter [4l| . 

Although large data sets have the potential to help 
provide a better picture of social contagion, analyzing 



them without accompanying dynamical models offers lit- 
tle hope of distinguishing between underlying causes of 
individual behavior (social influence versus homophily 
versus covariates) Statistical methods have been 

developed to approach the data side of this problem [i^ , 
but mathematical modelling is an underappreciated and 
crucial component of these efforts. In particular, sim- 
ple models make it possible to isolate effects (e.g., social 
contagion) and develop and test quantitative diagnostics 
that characterize macroscopic dynamics. 

Early studies in the sociology of behavioral cascades 
considered threshold models of binary decisions [2lj . In 
these models, agents can switch from an initially inactive 
state to an active state if a sufficient proportion of other 
agents are active. These models capture two important 
features [l|: interdependence (an agent's behavior de- 
pends on the behavior of other agents) and heterogeneity 
(differences in behavior are reflected in the distribution 
of thresholds). In recent studies 0, [43 - l46j . threshold 
models of social influence have been examined on net- 
works in which the nodes correspond to agents and the 
edges between nodes indicate who influences whom. In a 
network setting, models with simple threshold dynamics 
also capture fundamental mechanisms that can be related 
(at least by analogy) to a large range of phenomena that 
include failures in power grids and the transmission of in- 
fectious diseases [13, El] . Tractable models of threshold 
dynamics have also sparked a great deal of interest in the 
physics and applied mathematics communities, because 
many results from graph theory, statistical physics, and 
dynamical systems can be directly applied in this setting. 
For example, such techniques allow one to identify crit- 
ical cascade thresholds H, mean cascade sizes [43L and 
the effects of seed size [ij and network topology pSj . 

Motivated by the observation that not all opinions 
have equal weight, we introduce in this paper a model 
for multi-stage complex contagions in which agents can 
adopt several different states with variable levels of in- 
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flucncc. Almost all existing models assume that active 
agents exert equal influence over their peers. (A no- 
table exception is Ref. [HI, which recently considered 
networked agents with variable levels of social influence 
but used a very different framework from ours and also 
focused on different dynamics.) However, such a binary 
description of agents' states can be woefully inadequate. 
For example, it has been well-documented that support- 
ers with varying level of commitment are crucial in social 
movements [l|, |2| ; regular users of a product are more en- 
thusiastic recommenders than casual users; supporters of 
a political party can vary significantly in their desire and 
ability to recruit new members; there are substantial be- 
havioral differences between fans and fanatics of sports 
teams; and experience with free trial versions of software 
can greatly influence a user's decision to purchase a costly 
commercial version. 

We show schematics of single-stage and two-stage com- 
plex contagion models in Fig. [TJ To focus our discus- 
sion, we consider the simplest multi-stage progression, in 
which there are three possible states [see Fig.[ljb)]: inac- 
tive (no influence; unaware of an innovation or indifferent 
to a social movement), active (low level of influence; test- 
ing an innovation or supporting a social movement), and 
hyper-active (high level of influence; full adoption of an 
innovation or activists in a social movement). 

The rest of this paper is structured as follows. In the 
next two sections, we define our model and illustrate 
some of its novel dynamics. We demonstrate that it is 
possible for one cascade to drive the other one even in sit- 
uations in which there would be no propagation (or only 
small propagation) in the associated single-stage cascade 
model. We then use synthetic networks and analytical 
approximations to further explore the model's dynam- 
ics, and derive conditions for the appearance of cascades. 
Additional details are included in the Appendices. 



I. MULTI-STAGE MODEL 

We consider situations in which there is an underlying 
social network, which we represent using an unweighted, 
undirected graph with N nodes. At any given point in 
time, there are three possible influence levels for each 
node: Inactive nodes exert no influence on their neigh- 
bors and are said to be in state 5*0, active nodes exert 
some influence and are said to be in state Si , and hyper- 
active nodes exert a higher level of influence and are said 
to be in state S2 [see Fig. (Ub)]. Importantly, nodes in 
state S2 are a subset of nodes in state Si , because nodes 
that are S'2-active are necessarily S'l-active (i.e., we con- 
sider fanatics to be a specific type of fan), but 5'i-active 
nodes need not be S'2-active. A natural generalization of 
the two-stage model includes further levels of influence 
such that an ^n-active agent is 5'i-active for all 1 < i < n. 

To specify the model, we also need to indicate pre- 
cisely how nodes influence their neighbors and how the 
neighbors respond to that influence. Accordingly, we 



define the influence response function Fi(rni,m2,k) as 
the probability that a degree-fc node (i.e., a node with k 
neighbors) becomes 5'i-active given that it has mi neigh- 
bors in state 5i and 7712 neighbors in state 52. This 
macroscopic description can be derived directly from a 
microscopic description of response functions of individ- 
ual nodes [47[. Formulating our model in these general 
terms allows us to capture a wide range of types of lo- 
cal interactions between nodes via the detailed form of 
the response functions F,; [l^. In the following section, 
we use a threshold model to illustrate the dynamics of 
multi-stage contagion models. 



(a) Single-stage case 



(b) Multi-stage case 




FIG. 1. (a) Schematic of a single-stage complex contagion. 
All nodes can be either inactive (^o) or active (Si). Nodes 
that are barely above the S'l-threshold have the same level 
of influence as nodes that are strongly above the thresh- 
old, (b) Multi-stage complex contagion. A subset of active 
nodes (called "5'i-active") can become hyper-active (called 
"5'2-active" ) and have additional influence. Note that 52- 
active nodes are necessarily also Si-active. 



II. THRESHOLD MODELS 

We define the "peer pressure" P = {mi -\- /3m2)/k to 
be the total influence received by a degree-A: node from 
its mi neighbors in state Si and TO2 neighbors in state 
52, scaled by the node degree k. Note that our deflnition 
of states implies that a neighbor in state 52 contributes 
1 -|- /3 to the influence, so /3 measures the bonus influ- 
ence exerted by 52-active nodes. It is the presence of 
such bonus influence that distinguishes our multi-stage 
contagion model from single-stage models. 

For threshold models of complex contagion, a node 
becomes 5i-active if P is equal to or exceeds a certain 
threshold Ri (which can be different for each node). 
Therefore, the response function is Fi{mi,m2,k) = 
Ci (P) , where Ci is the cumulative distribution of thresh- 
olds for state Si across network nodes. In this paper, 
we focus mostly on uniform-threshold cases in which Si- 
activation thresholds Ri are the same for all nodes. In 
the uniform-threshold cases, the response functions are 
step functions: 



Fi{mi,m2, k) 



1, if (toi + I3m2)/k > R^ 
0, otherwise 



(1) 



Observe that the response functions Fi(mi,m2,l) can 
be used to model cases in which the received influence 
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is not scaled by a node's degree fc, so an active node in- 
fluences all of its neighbors equally (regardless of their 
degrees). We require that R2 > Ri in order to sat- 
isfy Fi{mi,m2,k) > F2(mi,m2, k), thereby guarantee- 
ing that all S'2-active nodes are also S'l-active. When 
(3 = 0, the S'l-state dynamics reduce to a single-stage 
case because the S'2-active nodes are indistinguishable 
from 5*1 nodes. As an initial condition, we select a small 
fraction (^''^ of nodes to be initially ^i-active, and these 
are not allowed to become less active. At each subse- 
quent time step (of size At = l/iV), we update a ran- 
domly chosen node according to the threshold rules ([T|). 
This implies monotone dynamics — i.e., nodes can never 
become less active than they are currently. 

In single-stage threshold models, it is usually the case 
that if a node needs a number m of active neighbors to 
become active, then any subset of its neighbors with at 
least m active nodes is sufficient to make it active. In 
our multi-stage model, however, there is a heterogeneity 
in the subset of a node's neighbors needed for activa- 
tion. For example, the subset might consist of 4 active 
neighbors, 2 active and 1 hyper-active neighbor, or just 2 
hyper-active neighbors. Moreover, depending on a node's 
threshold values, various possible subsets of neighbors 
can make it active or even hyper-active. 

We now compare single-stage and multi-stage cascade 
dynamics, highlighting cascades that occur in the multi- 
stage models that do not occur in the corresponding 
single-stage models. Because these dynamics cannot 
arise if there are only two types of nodes (inactive and 
active), we see that the presence of hyper-active nodes 
plays a crucial role in driving cascades on networks. We 
start by simulating these dynamics on the Faccbook net- 
work of students at the University of Oklahoma (recorded 
in September 2005 as a single-time snapshot) j50l[5l|. 
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(a) Single-stage case 















Strongly above threshold | 



(b) Multi-stage case 




FIG. 2. Comparison of single-stage [panel (a)] and multi-stage 
[panel (b)] cascade dynamics on the Oklahoma Facebook net- 
work [5y[. Time t is on the horizontal axis, and the frac- 
tions of nodes in each state are indicated on the vertical axis. 
Light blue, blue (and purple), and red regions represent So-, 
Si-, and S2-active nodes, respectively. In panel (a), /3 = 0; 
and in panel (b), /? = 0.5. The threshold parameter values 
are Ri = 0.15 and R2 = 0.3. We use 348 Si-active seed 
nodes (corresponding to 0^^' sa 0.02) and S2-active seeds 
(0'-^^ = 0). There is no cascade in panel (a), but some of 
the nodes (colored in purple) are well above the Si-threshold. 
The bonus influence of S2-active nodes drives a cascade in 
panel (b). 



Note, however that some nodes are well above the activa- 
tion threshold (purple region). Specifically, these nodes 
surpass the threshold R2 = 0.3 but have no additional 
influence. In Fig. [^Jb), S'2-active nodes that surpass the 
higher threshold R2 = 0.3 have 1.5 times as much in- 
fluence as S'l-active nodes. This additional influence is 
enough to trigger a global (system-wide) cascade. 



A. Cascades Driven by High Influencers 

This example illustrates that a small amount of addi- 
tional influence can trigger cascades. This may model, 
for example, the role of a charismatic leader in a social 
movement, or the potential effect of customer product re- 
views on retail websites such as Amazon. In the context 
of our model, the presence of 5'2-active nodes triggers a 
cascade of S'l-active nodes that would not have occurred 
otherwise. In particular, this effect arises speciflcally due 
to the extra influence attributed to S2-active nodes, who 
are significantly above the Si-activation threshold. 

In Fig. [21 we compare the dynamics of a single-stage 
case [panel (a); (3 = 0] and a multi-stage case [panel (b); 
(3 = 0.5]. Both cases use the response function defined in 
Eq. ([T|) with the following parameter values: Ri = 0.15 
and i?2 = 0.3. In both cases, we show an average over 
100 realizations with the same initially active nodes [53 |. 
In Fig. m^a), S2-active nodes have no influence on the 
activation of Si-active nodes (because (3 = 0) and there 
is no cascade: the fraction of Si nodes remains small. 



B. Cascades Driven by Low Influencers 

This example illustrates that a small number of addi- 
tional low-level influencers that are Si-active can also 
trigger a cascade. This could represent situations in 
which a company gives out free trials of a product to po- 
tential customers with the aim of boosting full product 
sales. Once again, we use the Facebook network to com- 
pare two simulations. In the first, we set Ri = R2 = 0.2; 
in the second, we set Ri = 0.15 and R2 = 0.2. The 
first case is essentially a single-stage process because the 
Si dynamics are slaved to the S2 dynamics. The other 
parameters are the same in both simulations: (3 = 0.3 
and 0'^^) = (f)^'^'^ ~ 0.02, and the response function is 
again given by Eq. ([T|). We again average over 100 re- 
alizations using the same initially active nodes in both 
cases [sst . 

There is no cascade in Fig. EJa), in which the acti- 
vation thresholds are equal. In Fig. [Hb), however, the 
Si-activation threshold is slightly lower. This results in 
a small number of additional Si -active nodes, which is 
enough to trigger a cascade. 
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(a) Single-stage case 




FIG. 3. Comparison of single-stage [panel (a)] and multi-stage 
[panel (b)] cascade dynamics on the Oklahoma Facebook net- 
work [501 . As in Fig. [21 the time t is on the horizontal axis, 
and the fractions of nodes in each state are indicated on the 
vertical axis. Light blue, blue, and red regions represent So-, 
Si-, and S2-active nodes, respectively. We use 348 S2-active 
seed nodes (corresponding to t^'^-* = (f)^^^ ~ 0.02) and j3 = 0.3. 
In panel (a), Ri = R2 — 0.2, so the Si dynamics are slaved to 
the S2 dynamics. A small change in the threshold parameter 
{Ri = 0.15) in panel (b) yields a small number of additional 
Si-active nodes, which are nevertheless enough to trigger a 
cascade. 



III. SYNTHETIC NETWORKS 

To better understand the cascade dynamics, it is in- 
structive to consider well-chosen synthetic networks that 
allow us to control the increase in the final cascade size. 

We design synthetic scenarios in which some nodes do 
not become active in the single-stage case, but the cas- 
cade in the multi-stage case occupies the entire system. 
To this end, we construct random networks consisting of 
nodes of degrees 4 and 24 in proportion 1:1 with posi- 
tive degree-degree correlations. (We dub such networks 
"(4,24)-correlated random networks"; degree assortativ- 
ity is positive when degree-degree correlations arc pos- 
itive.) We then use the response function defined in 
Eq. ([1]) that scales the threshold by node degree. It 
is thus harder for high-degree nodes to become active 
because they need a larger number of active neighbors 
(most of which are also of high degree). We construct 
the network using the method described in Ref. [s^ . 
where the assortativity is captured by the joint distribu- 
tion P(fc, fc'), which gives the probability that a randomly 
chosen edge connects nodes of degrees k and k' . For our 
example, we choose P{k, k') to be a (symmetric) ma- 
trix whose non-zero entries satisfy P(4, 4)/P(4, 24) = 3 
and P(24, 24) /F(24, 4) = 23. This gives a network with 
a specified amount of (positive) assortativity. An addi- 
tional consequence of the multi-stage dynamics on such 
networks is that trajectories spend a significant amount 
of time around a value that is not the final steady state. 

In the first example, illustrated in Fig. [31 the extra 
influence exerted by S'2-active nodes is needed to drive 
a system-wide cascade. To demonstrate this effect, we 



compare the dynamics resulting from a single-stage case, 
captured by the S'l-dynamics of the multi-stage case with 
upper threshold R2 = 00, with those resulting from a 
(true) multi-stage case with R2 = 0.7. (See the caption 
of Fig.[4]for the values of the other parameters.) In Fig.[4l 
we show numerical results using symbols and analytical 
results given by Eqs. ((SI)"® using curves. The analytical 
results qualitatively reproduce the numerical behavior; in 
some cases, we also observe good quantitative agreement. 
However, as discussed in Appendix [Bl some novel effects 
arising in the multi-stage model are not captured by the 
analytical approximation and can potentially lead to in- 
correct estimates. We show the aggregate fractions of ^i- 
and S'2-active nodes for the single-stage case in Fig. [H^a) 
and for the multi-stage case in Fig. [HJb). We show the 
separate temporal evolutions for the two degree classes in 
Figs.[4l^c,d) and the temporal evolution for nodes in each 
degree class that are 6*1- but not S'2-active in Fig. [3Je). 

In the single-stage case, only the low-degree nodes ulti- 
mately become S'l-active, which results in the aggregate 
active fraction of 0.5 illustrated in Fig. [HJa). Observe 
that none of the high-degree nodes become S'l-active [see 
Fig. [H^c)] and that no nodes become S'2-active because 
R2 = 00. In the multi-stage case, which we show in 
Fig. [ll[b), the low-degree nodes that in the single-stage 
case were signiflcantly above the Si-activation threshold 
eventually become S'2-active and hence exert more influ- 
ence on their neighbors. As we show in Fig. [H^d), this 
initiates a gradual increase in the number of 5i-active 
high-degree nodes (starting around t = 10) until there 
are sufficiently many to trigger a delayed rapid transi- 
tion or a secondary cascade (around t = 13) in which all 
nodes become 5*2 active. 

In Fig.[4l[e), we show for each degree class the temporal 
evolution of the fraction of the S'l-active nodes that are 
not S2-active. The number of such nodes is given by the 
difference between S'l-active and S'2-active nodes. The 
peaks in Fig. [4lje) imply that nodes in each degree class 
first become Si active and that there is some delay for 
their S'2-activation. 

Now imagine that only the Si-dynamics in Fig. [H^b) 
can be observed. For example, suppose that a publisher 
knows which nodes purchased a particular book but has 
no idea about the level of excitement of any of the nodes. 
In this case, at time t ~ 8, the publisher might mis- 
takenly conclude that the product is not going to be 
sold anymore and could perhaps discontinue it. How- 
ever, the heterogeneity of excitement and influence levels 
among book purchasers suggests instead that the pub- 
lisher should continue printing more copies of the book. 
In this toy example, there would be a sharp rise in sales 
at time t = 13 caused by the S'2-active agents (who, 
e.g., could represent customers who write book reviews 
on Amazon). 

We now consider another example, illustrated in Fig.[5l 
in which the addition of an S'l-active state enhances the 
propagation of an S2-cascade, which (in this case) eventu- 
ally reaches the entire network. In Fig.lSJa), we illustrate 
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FIG. 4. Demonstration of dynamics when an S2-cascade 
drives an 5*1 -cascade. Panels (a) and (b) show the aggregate 
fractions of 5*1- and S'2-active nodes, panels (c) and (d) show 
these fractions for each degree class separately, and panel (e) 
shows the fractions of nodes in each degree class that are 5*1- 
but not S'2-active. We show the numerical results (averaged 
over 100 realizations) using symbols and the analytical results 
given by Eqs. ©-(jS]) using curves. The timescales are inde- 
pendent of network size TV, which we take to be A'^ = 10*. 
The values of the other parameters are Ri = 0.2, R2 = 0.7, 
/3 = 0.45, <;6<^' = 10"^, and (/)'^' = 0. (We choose seed nodes 
uniformly at random.) We use an upper threshold of i?2 = 00 
to model the single-stage case. 



FIG. 5. Demonstration of dynamics when an Si-cascade 
drives an S2-cascade. Panels (a) and (b) show the aggregate 
fractions of 5*1- and 5'2-active nodes, panels (c) and (d) show 
these fractions for each degree class separately, and panel (e) 
shows the fractions of nodes in each degree class that are 5*1- 
but not ^-active. We show the numerical results (averaged 
over 100 realizations) using symbols and the analytical results 
given by Eqs. ©-((S]) using curves. The timescales are inde- 
pendent of network size A'', which we take to be iV = 10*. We 
use the values Ri — R2 — 0.35 to model the single-stage case 
(panels (a) and (c)), and the values Ri — 0.22, R2 = 0.35 
for the multi-stage case (panels (b), (d), (e)). The values of 
the other parameters are p = 0.45, and (j!>^^' = (j!>^^' = 10"'^ 
(where we choose the seed uniformly at random). 



the case in which Ri = R2 = 0.35. This is effectively a 
single-stage scenario because the S'2-active nodes are also 
active. (See the caption of Fig. [S] for the values of all 
parameters.) Similar to the previous example, all of the 
low-degree nodes become active, but now they are also all 
52 active. None of the high-degree nodes become active 
[see Fig.[5l^c)]. In order to obtain a (true) multi-stage sce- 
nario, we reduce the i?i threshold to 0.22 and show the 
resulting aggregate dynamics in Fig.[SJb). The dynamics 
are initially qualitatively similar to the single-stage case 
of Fig. \5la). However, after some time passes, there is 
a surge of high-degree nodes that become S'l-active due 
to the lower S'l threshold. They then drive a cascade of 
52-active nodes [sec Fig. [SJd)]. 

In panel (e) of Fig. [SJ we show the temporal evolu- 
tion of the fraction of the S'l-active nodes that are not 
S'2-active. This quantity exhibits a peak for the high- 
degree nodes but remains at zero for low-degree nodes, 
indicating that high-degree nodes become ^2 active some 
time after becoming 5*1 active, whereas low-degree nodes 
become S2 and S'l active at the same time. 



IV. ANALYSIS 

We now present an analytical approximation for the 
temporal evolution of the fraction of active nodes in our 
multi-stage model. The method that we employ is based 
on pairwise interactions between nodes [3 Ell and en- 
tails two requirements: (i) for any fixed k, the response 
functions Fi must be non-decreasing functions of both mi 
and m2, and (ii) Fi{mi,m2,k) > Fi+i{mi,m2,k). Con- 
dition (i) reflects the effect of positive externalities — i.e., 
the more active neighbors a node has, the more likely that 
it becomes Si-active. Condition (ii) follows from the fact 
that the number of hyper-activc nodes should not exceed 
the number of all active nodes in the system. The situa- 
tions that we illustrate in this paper satisfy these condi- 
tions, which are sensible general assumptions in studying 
social influence. 

One computes the density p^'' (t) of degrec-fc nodes that 
are Si-active at time t by solving the set of ordinary dif- 
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ferential equations 

p«W = ff«(p«W,g(i)W,Q(^H0) , (2) 
g«(t)-Gw(QW(t),g(2)(i)) , (3) 

where Q^*) is a vector of auxiliary variables. We present 
the functionals Hj^^ and G^^^ (each of which depends 
on Fi and the network topology) and the derivation of 
Eqs. Q-® in Appendix El The fraction (j)^^'> of nodes 
that are initially Si-activc specifies the initial conditions 
p«(0) = 0« and Q(^)(0) = . The ag- 

gregate fraction of nodes that are 5i-active at time t is 
= J2k=o PkPk\t), where Pk is the degree distri- 
bution of the network and fcmax is the maximum node 
degree in the network. 



FINAL STATE AND TEMPORAL 
EVOLUTION OF CASCADES 
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In Fig. ini we compare the analytical predictions of 
Eqs. ©-(O with numerical simulations of both the final 
values and the temporal evolution of active fractions of 
nodes. This figure uses two example network ensembles: 
(i) Erdos-Renyi random graphs (i.e., graphs in which each 
pair of nodes is connected by an edge with equal, inde- 
pendent probability) with mean degree 2 = 5; and (ii) 
uncorrelated random graphs (i.e., random graphs whose 
joint degree-degree distribution can be expressed in terms 
of the degree distribution as P{k, k') = kPkk'Pl./z^) con- 
sisting of degree-4 and degree-5 nodes in proportion 1:2. 
We dub the latter graphs "(4,5)-uncorrelated random 
networks" . We do not plot the final fraction of S'l-active 
nodes, as it equals 1 in each of the examples illustrated 
in Fig. |6l As one can see in Fig. [H Eqs. dl])-© correctly 
predict the final fractions of active nodes in all cases and 
are in good agreement with the numerically computed 
temporal evolution. 

In Fig.inija), the blue dash-dotted curve corresponding 
to the (4,5)-uncorrelated random network has steps at 
/3 = 1/4 and f3 = 1/3. The reason for these steps is as 
follows: When (3 < 1/4, degree-4 nodes can never receive 
enough influence from their neighbors to overcome the 
threshold i?2 = 5 to become S'2-active (see Eq. ([T]) with 
fc = 1). Therefore, the final fraction of S'2-active nodes 
p^^^ {00) is given by the fraction of degree-5 nodes in the 
network (which is equal to 2/3 in this example). When 
/? G [1/4,1/3), degree-4 nodes become S'2-active only 
when all of their neighbors are S'2-active. Consequently, 
a finite fraction (about 0.26) of degree-4 nodes becomes 
S'2-active, yielding the aggregate value of p^^^ (00) w 0.75. 
We show the temporal evolution of the active fraction of 
nodes for each degree class for this case in Fig. [nib,c). 
Finally, for /3 > 1/3, degree-4 nodes become S'2-active 
if all of their neighbors are S'l-active and any three of 
them are 5'2-active. In this situation, all degree-4 nodes 
become S'2-active at the end of the cascade. 



FIG. 6. Comparison of numerical computations (symbols) 
with analytical predictions of Eqs. ©-(jSj (curves) for (a) 
the final fractions of 5'2-active nodes as functions of the 
bonus influence /3 for ensembles of (4,5)-uncorrelated random 
networks (blue dash-dotted curve) and Erdos-Renyi random 
graphs with mean degree z — 5 and (b,c) the temporal evolu- 
tion of active fraction of nodes in each degree class in (4,5)- 
uncorrelated random networks for /? — 0.25. In panel (a), we 
use the response functions of Eq. fi]) with k = 1 and uniform 
thresholds Ri — 1 and R2 = 5 for (4,5)-uncorrelated random 
networks (blue dash-dotted curve) and Erdos-Renyi random 
graphs with mean degree z = 5 (black dashed curve). For 
the latter, we show using a solid red curve the case in which 
i?2 thresholds are Gaussian-distributed with mean fi — 5 and 
standard deviation a = 0.1. The total number of nodes in 
each network is A'^ = lO**. In all cases, we initially Si-activate 
a fraction <^'^' = 10"'^ of nodes chosen uniformly at random, 
and we average over 100 numerical realizations of networks 
and initial conditions. 



VI. CASCADE CONDITION AND 
BIFURCATION ANALYSIS 

In Fig. [71 we illustrate the relationship between the fi- 
nal fraction of S'l-active nodes p^^\ the bonus influence 13 
and the mean-degree z for Erdos-Renyi random graphs. 
Darker colors indicate larger final activation fractions. 
The final fraction of S2-active nodes is qualitatively sim- 
ilar. Note that when (3 = (i.e., single-stage dynamics), 
there is no cascade. 

A bifurcation analysis gives an analytical estimate of 
the boundary of the region in which cascades occur. In 
analogy to the methods developed for single-stage mod- 
els [3 111], we derive a cascade condition from Eqs. (HJ- 
([3]). (We present full details in Appendix ICl) Briefly, 
we compute zero eigenvalues of the Jacobian matrix of 
G evaluated at Q^^) = Q^^) ^ q ^his yields a closed- 
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form expression that approximates the boundary of the 
cascade region. This approximation, which is crude but 
given by a closed- form expression (see Appendix ICj) . 
yields the dashed curve in Fig. [T] 

For low z values (towards the left of Fig. [71), we find a 

continuous transition from low to high values of pS^ . In 
contrast, for higher values of z, we find a discontinuous 
transition. This jump arises from a saddle-node bifurca- 
tion that occurs as z is increased while /? is held fixed. 
This bifurcation can be followed numerically by solving 
G = and finding zero eigenvalues of the Jacobian eval- 
uated at the corresponding equilibria (see the solid curve 
in Fig. [7]). We provide full details in Appendix [Cl 




FIG. 7. Two-parameter bifurcation diagram of p^' (whose 
value is indicated by color) calculated from Eqs. (O)^® for 
Erdos-Renyi random graphs. The mean degree z is on the 
horizontal axis, and the bonus influence P is on the vertical 
axis. The first threshold is _Ri = 0.3, and the second threshold 
i?2 is Gaussian-distributed with mean fj, = 0.8 and standard 
deviation a = 0.2. The initial seed fractions are 0^^^ = 2 x 
10"^ and <^<^) = 0. The dashed curve gives the boundary 
of the cascade condition, and the solid curve is a numerical 
continuation of the saddle-node bifurcation. 

Just as in Watts's single-stage cascade model 0, nu- 
merical simulations using parameter values close to a 
saddle-node bifurcation are very sensitive to the choice of 
seed nodes. A recent empirical study [s^ has highlighted 
the fact that real cascades are extremely rare events. 
This is consistent with the dynamics that occur near the 
saddle-node bifurcation: a few specific choices of initial 
seeds produce large-scale cascades but most choices do 
not. 



node from its neighbors using a general function of the 
node's degree and the state of its nearest neighbors. We 
illustrated that this model can exhibit interesting dynam- 
ics that is not possible with single-stage cascade models. 
This includes, in particular, the interplay between the 
cascades of fans (active nodes) and fanatics (hyper-active 
nodes), where one cascade can drive the other and vice 
versa. Our model and our analytical results can be gen- 
eralized to multi-stage cases with any (finite) number of 
active states. The model can be developed further to al- 
low one to distinguish the level of a node's commitment 
and the influence it has on its neighbors. 

Different levels of commitment and influence have well- 
documented importance on social movements, product 
advertising, and other sociological, political, and eco- 
nomic situations. However, mere observation and data 
analysis of complex social dynamics make it difficult to 
discern the relative importance of social influence, ho- 
mophily, and covariates on the observations ^42j . It is 
therefore imperative to develop new mathematical mod- 
els to tackle this challenging situation, and we hope that 
the model we have introduced in this paper will prove 
beneficial for such efforts. 
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VII. CONCLUSIONS 

Social movements and other forms of collective action 
require an ensemble of supporters with different levels 
of commitment, and social influence can make a crucial 
difference as to whether or not they succeed. This moti- 
vates the development of analytically tractable complex 
contagion models with multiple stages in which different 
agents have different levels of influence. 

In the present paper, we introduced and analyzed such 
a model, in which we dcflne the level of influence on a 



Appendix A: Analytical Approximation 

In this section, we derive approximate analytical re- 
sults for the time-dependence of the density of active 
nodes in our multi-stage model. The derivation is based 
on pairwise interactions between nodes and builds on the 
method described in Refs. [H,!!^. We first consider the 
synchronous updating case in which the states of all iV 
network nodes are updated at each discrete time step. 
We then extend the results to situations in which only 
the states of a certain fraction r of nodes (chosen uni- 
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formly at random) arc updated. Thus, the value r = 1 
corresponds to synchronous updating of all nodes and 
T = 1/N corresponds to the completely asynchronous 
case in which a single (randomly chosen) node is updated 
at each time step. For our model, both types of updating 
lead to the same final density values, but the transient 
behavior can be different. 

We focus on a class of undirected, unweighted ran- 
dom networks defined by the joint degree-degree distri- 
bution P{k,k'), which gives the probability that a ran- 
domly chosen edge connects nodes with degrees k and 
k' (with everything else selected uniformly at random). 
This class of random networks reduces to configuration- 
model networks defined by the degree distribution Pk 
when the joint degree-degree distribution is taken to be 
P{k,k') = kPkk'Pl./z^, where z = J^l^o kPk is the 
mean degree and fcmax is the maximum degree in the 
network. We assume that the number of short loops is 
small and that the network topology is locally tree-like. 

We start by calculating the fraction p^*' (n) of degree-fc 
nodes that are S'i-active {i G {1,2}) at the nth time step 
of the synchronous update process (i.e., after the nth 
synchronous update of all nodes). We thus consider a 
randomly chosen degree-fc node A and calculate its prob- 
ability of being S'^-active at time step n. Because we 
choose A uniformly at random, this probability is pl^\n). 
A fraction of all nodes are initially set to be S'i-activc. 
Thus, node A is initially S'i-activc with probability 0^'^. 
If it is not initially S'^-active, then (as discussed in the 
main text) it can become S^-active after a synchronous 
update with probability Fi{mi,m2, k). The arguments 
TOi and m2 are, respectively, the numbers of A's neigh- 
bors in states Si and ^2 before the update. 

We denote by q"i^\n) the probability that node A's 
neighbors are Sj-active at the nth time step, conditioned 
on node A itself not being S^-active [5H|. Thus, the prob- 
ability that exactly mi out of k neighbors of node A are 

S'l-active is given by iq^j^\n)\, where 



k — m 



(Al) 



is the binomial distribution. Similarly, the prob- 
ability that exactly 7712 nodes out of these mi 
S'l-active neighbors are also S'2-active is given by 

B'^^l {qi\n) /q^^\n)) . Note that qi\n) / q^^\n) is 
the probability that an S'l-active neighbor of node A is 
also S2-active. Combining these probabilities yields (for 
^-{1,2}) 

k 

pf{n + 1) = + (1 - 0«) Y. (^f V)) 



m-i— 



mi 

E BZ\%y^]F.[mi,m2,k). 



m2=0 



(A2) 



In Eq. (jA2[) . we have assumed that the states of any 
two neighbors of node A are independent. We would 



expect this to be the case for a graph that is locally 
tree-like, such as random networks constructed using the 
configuration model. Although this assumption breaks 
down on real-world networks with high clustering coef- 
ficients and/or significant community structure, it has 
been demonstrated recently using several dynamical pro- 
cesses (including single-stage complex contagions) that 
cascade results obtained using locally tree-like approxi- 
mations often remain valid 15711. 



Distance 




FIG. 8. Tree-like structure of a network near node A, which is 
treated as the root of the tree. Only nodes within a distance 
n from node A can influence p^''(n), the probability that A 
is Si-active at time step n. 

We now need to calculate the probabilities q\^\n) 
that the neighbors of node A are S,;-active at time step 
n, given that A is not S^-active. Because a neighbor 
of the degree-fc node A has degree k' with probability 
P(fc, fc') lY^y P{k, k') , it follows that 



E,, P(fc,fcOgi'?(n) 



(A3) 



where q^y (n) is the probability that a degree-fc' neighbor 
of node A is Sj-active after the nth synchronous update, 
given that A is not Sj-active In order to calcu- 

late q^\n) and thus q^\n), we need to consider nodes 
within distance n of node A, because the influence from 
only these nodes can reach A in n time steps. Follow- 
ing similar reasoning as for Eq. (jA2p we express (n) 
in terms of such probabilities for nodes one step further 
away from A (see Fig. [5]): 



.1 



mi— 

E %)7^ i^i(mi, m2, fc) (A4) 



and 



(n + l) = 0^^) + (l 



fc-l 



2) 



2 ^mi i gfV) 
(1) 



E<T^(«f^H) 

mi— 

1 - (tk^{n)^ F2{mi,m2,k) 



+ q^ {n)F2{mi + l,ni2,k) 



(A5) 
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Note that unlike Eq. (|A2I) . the sums over mi in Eqs. (jA4[) 
and (|A5p run to toi = fc — 1, implying that a node that is 
not part of the seed fraction 0^'' can only be Si-activated 
by its — 1 children at the level further away from A. In 
Eq. (jASp . one must also take into account the possibility 
that the parent node one step closer to A could already 
be S'l-activc with probability q^^\n), so its last term is 
different from that in Eq. (|A4[) . 

One can write Eqs. (|A2p - (|A5p in concise form using 

vector notation Q'^^\n) ~ q['\n), . . . , g^*-* (n) : 

P^^\n) = h^^^ [Q^^Hn-l),Q^'\n-l)) , (A6) 
g«(7i)=g«(g(i)(n-l),Q(2)(„„i)) . (A7) 

Starting with Q(*'(0) = [(^(*), . . . , 0^*)] , we iterate 

Eqs. (|A6P - (|A7p to obtain pf' {n) for all n and k. Because 
node A has degree k with probability Pfc, the aggregate 
fraction of Si-active nodes at time step n is given by 



We now consider asynchronous updating, in which only 
a fraction r of nodes is updated at each time step. We 
choose time step At = r to have a common time scale 
for all T (including the synchronous updating case of r = 
1). If the updating is synchronous (r = 1), then the 
probability Q^"^ increases by AQ^') = - 
Q^*). In other words, all nodes that are available for 
activation arc activated. In the asynchronous updating 
case, AQW = r (gW {Q^^\Q^^^) - Q^'^). Therefore, for 
sufficiently low values of r, the temporal evolutions of 
Q^^^ and p^*' can be approximated as continuous. This 
yields the following ordinary differential equations: 



(A8) 



which are Eqs. (H])-® of the main text. 



(A12) 



(A13) 



Also note that Eqs. (|A2p - (|A5p can be simplified for un- 
correlatcd configuration- model networks — i.e., when the 
joint degree-degree distribution factorizcs as P{k,k') = 
kPkk' P^,/ . In this case one obtains 



k mi =0 

/ ,(2) 

(A9) 



/ (2) ( \\ 



m2=0 



kPk 



fe-i 



mi— 

xFi(mi,m2,fc), 

1)^0(2) 

k-1 



1 - 



E 



kPk 



X E Bt-:U'Hn)) E S 



mi— 



qW{n) 



X [(l-g(i)(n))F2(mi,m2,fc) 
+ q'^'^Hn)F2{mi + l,m2,fc) 



(AlO) 



(All) 



Note, however, that Eqs. (jA9p - (jAlip are inaccurate for 
networks with degree-degree correlations. For example, 
they fail to predict the cascades illustrated in Figs. |4] 
and [5] 



Appendix B: Additional Features of the Model 

Our multi-stage model possesses several additional in- 
teresting features that we did not discuss in the main 
text. They motivate the development of more accurate 
analytical approximations than the one presented above. 
The reason for this is that the above theory, which we 
generalized from previous work on single-stage models 
and other dynamical systems [3 SI] , neglects some ef- 
fects that arise in the multi-stage model. These features 
are either minimal or absent entirely from single-stage 
contagions models, so the need to develop more accurate 
analytical approximations has become apparent only be- 
cause of the new model in this paper. Below we highlight 
a couple of examples of features that we have identified 
as requiring more accurate modelling. 

The first feature, which amounts to believing one's own 
gossip is similar to the "June bug effect" [59| from soci- 
ology, except that here the feedback is of primary impor- 
tance. This feature can be understood by considering a 
node that is initially Si-active but not S2-active. Sup- 
pose that this node Si-activates some of its neighbors at 
the next time step. After that, these Si-active neighbors 
can in turn S'2-activate the original node. That is, a node 
can become more active because of the feedback influence 
received from the neighbors it had Si-activated in the 
first place. The analytical approximation that we have 
employed docs not account for this effect, as it assumes 
that neighbors activated by a node are distributed across 
the network (rather than gathered around the node) . The 
ensuing theory thus may underestimate the number of 
S'2-active nodes in the system. 

The second feature, which we call "state segrega- 
tion", leads to theoretical predictions that overestimate 



10 





(a) Single-stage case 




(overall evolution) 


yT Q i'nod.s 


-^z"-'- 




(e) Multi-stage case 
S —S degree-4 nodes 



.nO.<t.>Ji»g . •'■'»'°-.no.l>,n..,o„..n..,n,.|,ni,|l 



10 15 20 25 30 35 40 45 



FIG. 9. This figure is analogous to Fig. |4] of the main text, 
but here we use a different value of the upper threshold R2 
for the multi-stage case [panels (b), (d) and (e)]. The value 
of R2 = 0.8 is used here and R2 = 0.7 is used in Fig. 2] 
As discussed in the text, the state segregation effect is more 
pronounced for R2 — 0.8, which implies that degree-4 nodes 
must have all their neighbors S'l-active in order to become 
52-active (whereas only three Si-active neighbors are needed 
when R2 = 0.7). Our analytical approximation does not prop- 
erly account for state segregation, and the consequences of 
that can be seen by comparing panels (d): the match with 
the numerical simulation in this figure is clearly worse than 
that in Fig. g] 



when the first S'2-activations of degree-4 nodes occur. 
Therefore, the theory assumes that there are at least 
some connections between inactive degree-24 nodes and 
early S'2-active degree-4 nodes, even though there are no 
such connections in the network. As a result, the theory 
overestimates the fraction of S'l-active dcgrcc-24 nodes 
in Fig.El^d). 

The consequences of state segregation become less pro- 
nounced when the S'2-threshold is reduced from R2 — 0.8 
to i?2 = 0.7. The degree-4 nodes now become S'2-active if 
at least 3 (rather than 4) of their neighbors are 5i-active, 
thereby relaxing the restriction of not being connected to 
any of the degree-24 nodes. One can see this in Fig. SJ^d) 
in the main text, where the agreement between theory 
and numerical results is clearly better than in Fig. IHJd). 

Finally, we should recognize that the multi-stage model 
requires the use of a more comprehensive set of prob- 
abilities than the probabilities g^*'' (n) defined in Ap- 
pendix [A] For example, strictly speaking q^"^ can be 

used in Eq. (jASP for calculating q^^ only in the case of 
a direct transition of the parent node from state 5*0 to 
^2. The correctness of the use of g^^-* for calculating q'j^'' 
depends on the extent to which the fact that the par- 
ent node is S'l-active will affect the probability that it 

will become S2-active. In many cases, the use of q^'^ is 
adequate. In general, however, if the parent node is al- 
ready Si-active, the probability that its child is Si-active 
will be higher than what is given by q^.^''. For this rea- 
son, the analytical approximation given in Appendix |X] 
can underestimate (or even fail to predict) the cascades 
of activations in some parameter regimes, though better 
approximations would necessarily be more involved. 



Appendix C: Cascade Condition and Bifurcation 
Analysis 



the numerically observed fractions of active nodes [see 
Fig. [Hid)]. In this figure, we consider (4,24)-correlated 
random networks, and the theory predicts that Si- 
activation of degrec-24 nodes happens earlier than what 
we observe numerically. We explain this as follows. 
Degree-24 nodes begin to become Si-active because they 
are activated by their S2-active degree-4 neighbors. How- 
ever, degree-4 nodes first become S2-active [for the 
threshold value R2 = 0.8 used in Fig. IH^d)] only if they 
are connected exclusively to other degree-4 nodes who 
are all Si-active at that time. Degrec-4 nodes do not 
receive sufficient influence to become S2-activc if they 
are connected to any of the degree-24 nodes (who are 
all inactive at that time). In other words, there is a 
negative correlation between a degree-4 node becoming 
S2-active and that node being connected to an inactive 
degree-24 node. Our theory does not take this anticor- 
relation into account: it assumes that inactive degree-24 
nodes are equally likely to have connections to dcgree- 
4 nodes of any state. This assumption docs not hold 



We now derive the cascade condition, which deter- 
mines whether an infinitesimally small seed fraction of 
activated nodes triggers a finite-size cascade (i.e., a cas- 
cade in which the number of active nodes scales linearly 
with the size of the network). We present a derivation 
for uncorrelated random networks with degree distribu- 
tion Pk [l^j. It is sufficient to analyze the dynamics 
of the auxiliary variables q^'^\ which are described by 
Eqs. (|A10p and (jAll[) . To avoid restating lengthy formu- 
lae, we rewrite these equations as 



The dynamics under asynchronous updating can be ap- 
proximated by the ordinary differential equations 

^W^^^(0(,(i),,(2))_^(0=G(0(gW,,(^)) , (C2) 

which, written in vector form, results in the two- 
dimensional dynamical system 

q^~g{q)-q. (C3) 
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Because the equilibria for both synchronous and asyn- 
chronous updating are the same, our cascade condition 
and bifurcation analysis are valid for both cases. 

In analogy to the methodology proposed for single- 
stage cascades [13, SI] , a small initial seed will grow if 
one of the (real) eigenvalues of the Jacobian matrix of G 
evaluated at g = is positive. This yields the following 
condition for global cascades: 

D2.9rDi.g(') - (Di.gW - l) (d^.?^'^ - l) > , (C4) 

where Dj^Q*'' is the partial derivative of g*^*' with respect 
to q^'^^ evaluated at g = 0. The partial derivatives are 
given by 

k ^ 
X [Fi(l,0,fc)-^^i(0,0,fc)] , 

k 

X [Fi(l,l,fc)-^^i(l,0,fc)] , 

k 
k 

X [F2(l,l,fc)-^^2(l,0,fc)] . 

Note that if = 0, then F,(l, 1, k) = ^^(1, 0, k). There- 
fore, D25^^^ = V)2go^ ^ and Eq. JCH) reduces to the 
single-stage cascade condition derived in For net- 

works with degree-degree correlations, the Jacobian ma- 
trix is of higher dimension, so the eigenvalues must typ- 
ically be located numerically. 

Although the derivation of the cascade condition re- 
sults in a closed-form expression, it is a fairly crude ap- 



proximation. This can be seen in Fig. [3 where the area 
of parameter space in which cascades occur is larger than 
that predicted by the cascade condition. We will show 
that this error is in part due to the value at which the 
partial derivatives in (jC4p are evaluated. We observe 
that the sudden onset of activation cascades is typically 
associated with a saddle-node bifurcation, which can be 
accurately located numerically via linear stability analy- 
sis. To this end, defining the small perturbation ^ = q—q* 
and linearizing about the equilibrium point yields 

e= [D.g(g,)-/]e, (C5) 

where Dg(g*) is the Jacobian matrix of ^ evaluated at the 
equilibrium point g* and / is the identity matrix. We will 
denote each entry of the Jacobian by Dj-g; , i.e. the par- 
tial derivative of g''^^ with respect to g*^^^ evaluated at 
q = q*. The equilibrium point g* is unstable when one 
of the eigenvalues of Dg(g*) — / is positive [6l|. In con- 
trast to the derivation of the cascade condition, in which 
we set g* = 0, one must now determine the equilibrium 
point g*, which is typically only possible via numerical 
computation. To locate the bifurcation, one must solve 

^g'^^ {q^\q^^) , i € {1,2} (C6) 

for the equilibria and 

D2#'Di.g(^) - (Di#) - l) (D2.gl^' - l) = (C7) 

for the zero eigenvalue. (Note that (|C7[) is similar to the 
cascade condition (|C4|) . but here the partial derivatives 
are evaluated at the equilibrium point g = g*.) Because 
the equilibria give two free variables, one must use one 
of the model parameters as the third free variable in or- 
der to determine the location of the saddle-node bifurca- 
tion. (All other parameters are held constant.) It is then 
possible to use numerical continuation to trace out the 
bifurcations as a second parameter is changed. We show 
the results of this continuation in Fig.[7]of the main text. 
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